A Connectionist Approach to Propositional Phrase Attachment for Real World Texts

نویسندگان

  • Josep M. Sopena
  • Agustí Lloberas
  • Joan López-Moliner
چکیده

Ill this paper we describe a neural network-based approach to prepositional phrase attachment disam-biguation for real world texts. Although the use of semantic classes in this task seems intuitively to be adequate, methods employed to date have not used them very effectively. Causes of their poor results are discussed. Our model, which uses only classes, scores appreciably better than the other class-based methods which have been tested on the Wall Street Journal corpus. To date, the best result obtained using only classes was a score of 79.1%; we obtained an accuracy score of 86.8%. This score is among the best reported in the literature using this corpus. 1 Introduction Structural ambiguity is one of the most serious problems faced by Natural Language Processing (NLP) systems. It occurs when the syntactic information does not suffice to make an assignment decision. Prepositional phrase (PP) attachment is, perhaps, the canonical case of structural ambiguity. What kind of information should we use in order to solve this ambiguity? In most cases, the information needed comes from a local context, and the attach-lnent decision is based essentially on the relationships existing between predicates and arguments, what Katz y Fodor (1963) called selectional restrictions. For example, in the expression: (V accommodate) (gP Johnson's election) (PP as a director), the PP is attached to the NP. However, in the expression: (V taking) (NP that news) (PP as a sign to be cautions), the PP is attached to the verb. In both expressions, the attachment site is decided on tile basis of verb and noun seleetional restrictions. In other eases, the information determining the PP attachment comes from a global context. In this paper we will focus on the disambiguation mechanism based on selectional restrictions. Previous work has shown that it is extremely difficult to build handmade rule-based systems able to deal with this kind of problem. Since such handmade systems proved unsuccessful, in recent years two main methods have appeared capable of auto-1233 matic learning from tagged corpora: automatic rule based methods and statistical methods. In this paper we will show that, providing that the problem is correctly approached, an NN can obtain better results than any of the methods used to date for PP attachment disambiguation. Statistical methods consider how a local context can disambiguate PP attachment estimating the probability from a corpus: p(verb attachlv NP1 prep NP2) Since an NP can be arbitrarily complex, the …

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Aaaa Aaaa Aaaa

Alignment Stefan Wermter International Computer Science Institute 1947 Center Street Berkeley, CA 94704-1198 USA [email protected] Joseph Chen University of Hamburg Computer Science Department Vogt-K olln-Stra e 30 D-22527 Hamburg, Germany [email protected] Abstract 1We examine phrase alignment in a hybrid connectionist framework. We describe the architecture and the learn...

متن کامل

Prepositional Phrase Attachment through Semantic Association using Connectionist Approach

Determining the correct attachment site for Prepositional Phrase (PP) is one of the major sources of ambiguity in natural language parsing and analysis. In this paper, we describe a neural network based approach to prepositional phrase attachment for natural language text. Our approach disambiguates the attachment site for PP through semantic association among the constituents namely verb, noun...

متن کامل

Disambiguation of English PP Attachment using Multilingual Aligned Data

Prepositional phrase attachment (PP attachment) is a major source of ambiguity in English. It poses a substantial challenge to Machine Translation (MT) between English and languages that are not characterized by PP attachment ambiguity. In this paper we present an unsupervised, bilingual, corpus-based approach to the resolution of English PP attachment ambiguity. As data we use aligned linguist...

متن کامل

Connectionism and compositional semantics

Quite a few interesting experiments have been done applying neural networks to natural language tasks. Without detracting from the value of these early investigations, this paper argues that current neural network architectures are too weak to solve anything but toy language problems; Their downfall is the need for "dynamic inference," in which several pieces of information not previously seen ...

متن کامل

Initial Results on Hybrid Connectionist Bilingual Alignment

1 We examine phrase alignment in a hybrid connectionist framework. We describe the architecture and the learning algorithm of our approach. Simulations have been carried out to demonstrate the feasibility of this approach using two corpora. Although the results of our approach are still at an early stage, we found that a hybrid approach to phrase alignment has the potential to provide good resu...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1998